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Abstract 

Let (W)^o irreducible reversible discrete-time Markov chain on a finite state 

space Q. Denote its transition matrix by P. To avoid periodicity issues (and thus 
ensuring convergence to equilibrium) one often considers the continuous-time version of 
the chain {X^)t>o whose kernel is given by Ht ;= Another possibility 

is to consider the associated averaged chain whose distribution at time t is 

obtained by replacing by At := {P^ + 

A sequence of Markov chains is said to exhibit (total-variation) cutoff if the conver¬ 
gence to stationarity in total-variation distance is abrupt. Let be a sequence 

of irreducible reversible discrete-time Markov chains. In this work we prove that the 
sequence of associated continuous-time chains exhibits total-variation cutoff around 
time tn iff the sequence of the associated averaged chains exhibits total-variation cut¬ 
off around time tn- Moreover, we show that the width of the cutoff window for the 
sequence of associated averaged chains is at most that of the sequence of associated 
continuous-time chains. In fact, we establish more precise quantitative relations be¬ 
tween the mixing-times of the continuous-time and the averaged versions of a reversible 
Markov chain, which provide an affirmative answer to a problem raised by Aldous and 
Fill ([1, Open Problem 4.17]). 
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1 Introduction 


Generically, we shall denote the state space of a Markov chain by G and its stationary 
distribution by tt. We say that the chain is hnite, whenever is hnite. Let (W)^o 
irreducible Markov chain on a hnite state space fl with transition matrix P and stationary 
distribution tt. We denote such a chain by (t2,P, tt). A chain (fl, P, vr) is called reversible 
if 7r{x)P{x,y) = 7r{y)P{y,x), for all x,y Efl. 

We call a chain lazy, if P{x,x) >1/2 for all x E Q. To avoid periodicity and near¬ 
periodicity issues, one often considers the lazy version of a discrete time Markov chain, 
(X/")^Q, obtained by replacing P with Pl := |(/ -|- P). Periodicity issues can be avoided 
also by considering the continuous-time version of the chain, (X/)i>o. This is a continuous¬ 
time Markov chain whose heat kernel is dehned by Ht{x,y) := ^ 

classic result of probability theory that for any initial condition the distribution of both 
and X/ converge to vr when t tends to inhnity. The object of the theory of Mixing times of 
Markov chains is to study the characteristic of this convergence (see [7] for a self-contained 
introduction to the subject). 

Since reversible Markov chains can only have period 2, one may wonder whether it suffices 
to average over two consecutive times in order to avoid near-periodicity issues. This motivates 
considering the following Markov chain. For any f > 0, denote At := (P* -|- P*+^)/2. The 
averaged chain, (X/™)“g, with “initial state” x, is a Markov chain, whose distribution at 
time t > 0 is At{x, •), where At{x,y) := {P*{x,y) + P*'~^^{x,y))/2. Equivalently, := 

(Xt+^)^Q, where is a Bernoulli(1/2) random variable, independent of (Xt)^Q. In other 
words, if Xq ~ p, the averaged chain either starts at a random position distributed according 
to fi (i.e. it starts “at time 0”) or at a random position distributed as ^/i(x)P(x, •) (i.e. it 
starts “at time 1”) with equal probability. After this, the averaged chain evolves according 
to the transition matrix P. The hrst to investigate the averaged chain were Peres and Sousi 
[9]. We review their results in the related work section. 

A sequence of Markov chains is said to exhibit (total-variation) cutoff if the convergence 
to stationarity in total-variation distance is abrupt (throughout we consider cutoff only in 
total-variation). In this work we prove that given a sequence of irreducible reversible h- 
nite discrete-time Markov chains, the sequence of associated continuous-time chains exhibits 
total-variation cutoff around time iff the sequence of the associated averaged chains ex¬ 
hibits total-variation cutoff around time tn- See Corollary 1.10 for a precise statement (we 
defer the formal dehnition of cutoff to the paragraph preceding Corollary 1.10). In fact, we 
establish more precise quantitative relations between the mixing times of the continuous-time 
and of the averaged versions of a reversible discrete-time Markov chain (namely. Theorem 
1.1 and Proposition 1.8), which provide an affirmative answer to a problem raised by Aldous 
and Fill ([1, Open Problem 4.17], stated below). Moreover, we use them to deduce that 
when cutoff occurs, the width of the cutoff window for the sequence of associated averaged 
chains is at most that of the sequence of associated continuous-time chains (see Theorem 1.3 
for a precise statement). 

We denote by P|^ (resp. P^) the distribution of Xt (resp. (Xt)^Q), given that the initial 
distribution is p. Similarly, we denote by (resp. H^) the distribution of X/ (resp. (Xt)t>o) 
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given that Xq ~ fx. Finally, we denote by (resp. Pl.^i) the distribution of (resp. (Xj^)“q), 
given that Xq ~ jx. When /i(-) = l.=xi for some x G hi, we simply write P* (similarly, H^, 
and Plj.) and P^, (similarly, H^. and Pl,x)- 

We denote the set of distributions on a (finite) set hi by For any /i, z/ G ^(hl), 

their total-variation distance is defined as 

llh - j^IItv •= ^ If(^) - ^(^)l = 

The worst-case total-variation distance at time t of the continuous-time (resp. lazy) chain is 
dehned as 

dc{t) := max(ic(t,x) (respectively, di,{t) := max(iL(t,x)), 
where for every n G <^^(fl), 

dc(t,/i) := e •) - ttIItv = ||Hj, - ttIItv and 

dhit, /i) := ||P^(Xi^ e •) - vtIItv = ||Pl,m “ ^IItv- 

The £-mixing-time of the continuous-time (resp. lazy) chain is defined as 

tc{e) ;= inf {t : dc{t) ^ e} , := inf {f : di,{t) ^ e} . 

We also dehne the corresponding e-mixing-times w.r.t. initial distribution /i to be 

jj) := inf {t : d^it, jj) ^ e} and /i) := inf {t : di^{t, /i) ^ e} . 

Similarly, for the averaged chain we define (iave(^) := niax 2 ;gn (iave (^5 a^), where 

ri := ||( P ; + P ^^‘)/2 - 7 r ||^^ = + P ‘)/2 - 

The e-mixing-time of the averaged chain (respectively, w.r.t. Xq ~ fi, i.e. w.r.t. Xq™ ~ 
is denoted by 

4ve(£) := inf {t : dave(t) ^ £} (respectively, fave(e, A^) := inf {t : ds,ve{t,fi) ^ e}). 

When e = 1/4 we omit it from the above notation. 

We denote := {n G Z : n > 0} and R+ := {f G M : f > 0}. Let (j) : M+ R+ 
and ijj : (0,1] (0,1]. We write 0(f) ~ f if limt^oo 0(^)/^ = 1- We write 0 = o(l) if 

hm£^o'0(^) = 0. In [1] Aldous and Fill raised the following question: 

Question (Open Problem 4.17 [1]). Show that there exist t/j : (0,1] (0,1] andcp : R+ —)■ Z+ 

satisfying ip = o(l) and (j){t) ~ t such that for every finite irreducible reversible Markov chain, 

Vf > 0, dave(0(f)) < 0(dc(f)). 

Our Theorem 1.2, which is in fact a weaker version of our main result. Theorem 1.1, 
solves Aldous and Fill’s Problem. Denote a V 6 := max{a, &}, a A 6 := min{a, 6}. For every 
f G R we denote the ceiling of t by [f] := min{.s Eh z >t}. 
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Definition 1.1. Let 0 < a < 1/2, C* > 0, t > 1 and x E (0, 1). We define 


i>a,c{x)'■= I f\ {x + C\\og{2x)\~°') and 4>o.,c(t) := t + \Ct^^ ^/a\ogt]. 

Remark 1.2. Note that 4>a,c{t) ~ t and '0a,c = o(l), for all C > 0 and 0 < a < 1/2. 

Theorem 1.1. There exist absolute constants Ci,C 2 ,C '3 > 0 such that for every finite irre¬ 
ducible reversible Markov chain, (12,P, tt), /i G 0 < a < 1/2 and t >1, 


dL{4>a,Ci(t),T) < dc(t/2,fi) + C2t ". 

(1,1) 

daye{(f>a,Ci{t), L) < 4(2t,/i) Cat"". 

(1.2) 

dave (0a,C3 (t), p) ^ d^ft, ffj 2C2t 

(1,3) 


Moreover, (1.1)-(1.3) remain valid when jj, is omitted from both sides. 

Note that (1.3) follows from (1.1)-(1.2) by picking C3 so that ■ 

Remark 1.3. The converse inequality dc(t + 2t^/‘^) < da.„eit)-\-e~^ is easy ((1.9)). Combined 
with (1.3) one can readily see that dfi-) exhibits an abrupt transition iff davei,-) exhibits an 
abrupt transition (in which case, both occur around the same time). 

Theorem 1.2. There exist absolute constants Ci, C 2 > 0 such that for every finite irreducible 
reversible Markov chain 

davei(j>a,cAt)) < V'a.Ca(c^c(t)), for cvcry 0 < o < 1/2 and t>2. (1.4) 

Remark 1.4. Theorem 1.2 can be rephrased as follows. There exist absolute constants 
Cl, <^2 > 0 such that for every finite irreducible reversible Markov chain, 

tave{i>a,C 2 {£)) < (C(£)), for all 0 < a < 1/2 and 0 < e < 1. (1.5) 

Theorem 1.2 is an immediate consequence of (1.3) together with the “worst-case” estimate 
dfit) > (e-272)l|o|>i (e.g. [7, Lemma 20.11]). We omit the details. Theorem 1.1 follows in 
turn as the particular case s := 2 V Cyja logt of the following proposition. 

Proposition 1.5. There exists an absolute constant C such that for every finite irreducible 
reversible chain, (12,P, vr), every /i G t>2 and s G [2, e*] we have that 

d^it + |'s\/t],/i) < dc{t/2,pi) + Cs'^Tlogs. (1.6) 

dave(t + \sVt'],n) < di,{2t,iT) + Cs"^y/log'5- (1-7) 

We now make two remarks regarding the sharpness of (1.7). The first concerns the error 
term Cs“^\/Iog^ (and also the “error term”, fia,C 2 {dc(t)) — dfit), in (1.4)). The second 
concerns the “time-shift” term [s\/t]. 
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Remark 1.6. Denote s = Sn,a •= and t = := 4n + s. /n § 6 we construct for 

every 0 < a < 1/2 a sequence of chains with ti"'^ = (4 ± o(l))?7, such that for some absolute 
constants ci,C 2 > 0 the n-th chain in the sequence satisfies that 


4ve(^ + s) - dc{t) > — > 

s 


_C2_ 

[log(l/iic(i))l'^ 


( 1 . 8 ) 


Thus the inverse polynomial decay (w.r.t. s) in (1.7) is the correct order of decay, up to the 
value of the exponent. 

Remark 1.7. When s is fixed, the “time-shift” term s\/i in (1.7) is of order y/t. This 
cannot he improved. To see this, consider a birth and death chain on [n] := {1, 2,..., n} with 
P{i + 1, i) = 6“” = 1 — P{i + 1, i + 2) for z G [n — 2] and P(l, 2) = 1 = P{n, n — 1). Then if 
rn = o{y/n) we have that di^ifln — r„) = 1/2 ± o(l), while d^cvein — 3) = 1 — o(l). 


The following proposition offers a converse to Theorem 1.1. The argument in the proof 
of (1.9) is due to Peres and Sousi ([9, Lemma 2.3]). 

Proposition 1.8. Let {Q,P,7r) be a finite irreducible Markov chain. Then for every f G N, 
0 < s < y/t and p G ^{D), 


dc{t + sy/i, p) < di,ye{t, p) + e 
dL{2t \2s\/t '], p) < p) + 

(1.9) 

dc{t sy/i, p) < di^{2t, p) + e~^'^P. 

(1.10) 


Remark 1.9. In [11] p. 195, it is written: ”a theorem is Abelian if it says something about an 
average of a sequence from a hypothesis about its ordinary limit; it is Tauberian if conversely 
the implication goes from average to limit”. 

Proposition 1.8 is easier and more general than our Theorem 1.1 (as it does not assume 
reversibility) because it is an Abelian theorem, while our Theorem 1.1 is Tauberian, hence 
requires the reversibility assumption, as we now demonstrate. One (non-reversible) instance 
in which (1.7) fails is a biased random walk on the n-cycle with P{i,i — 1) = n~^ = 1 — 
P{i, i + 1), where i — 1 and i + 1 are defined modulo n and £ > 0 is arbitrary. In this example 
ti,{e)/{n^\ log£|) = 0(1), however tave(£)/(?^^^^| log^l) = 0(1) (uniformly in e e (0,1/2]/. 

Next, consider a sequence of such chains, ((fin, Pm '^n) : G N), each with its correspond¬ 
ing worst-distance from stationarity dn{t), its mixing-time t[f\ etc.. Loosely speaking, the 
(total-variation) cutoff phenomenon occurs when over a negligible period of time, known 
as the eutoff window, the worst-case total-variation distance drops abruptly from a value 
close to 1 to near 0. In other words, one should run the n-th chain until the cutoff time 
for it to even slightly mix in total-variation, whereas running it any further is essentially 
redundant. Formally, we say that a sequence of chains exhibits a continuous-time eutoff 
if the following sharp transition in its convergence to stationarity occurs: 

lim t/ 0 (e)/t/ 0 (l — e) = 1 , for every 0 < e < 1 . 
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We say that a sequence of chains exhibits an averaged cutoff (resp. lazy cutoff) if 
lim ^ive(^)/4ve(l - ^) = 1 (resp., lim t^^\e)/t'i\l - e) = 1), for every 0 < e < 1 . 

n^oo n^oo 

The following corollary follows at once from Theorem 1.1 together with Proposition 1.8. 

Corollary 1.10. Let Pn, Tin) be a sequence of finite irreducible reversible Markov chains. 
Then the following are equivalent 

(i) The sequence exhibits a continuous-time cutoff. 

(a) The sequence exhibits a lazy cutoff. 

(Hi) The sequence exhibits an averaged cutoff. 

Moreover, if (i) holds, then hm„^oo = lim„^oo = 1- 

Remark 1.11. The equivalence between (i) and (Hi) was previously unknown. In [4] it was 
shown that (i) and (ii) are equivalent even without the assumption of reversibility. 

Our last point of comparison is related to the width of the cutoff window. We say that 
a sequence of chains exhibits a continuous-time (resp. averaged) cutoff with a cutoff window 
Wn if Wn = o{t^^) (resp. Wn = o(fivl)) and for every 0 < £ < 1/4 there exists some constant 
Ce > 0 (depending only on e) such that 

Vn, t^fi^ {e) - t(f^ (1 - e) < C^Wn (resp. t^ffl{e) - (1 - e) < CeWn). 

One can dehne the notion of a cutoff window for a sequence of associated lazy chains in an 
analogous manner. Note that the window dehned in this manner is not unique. 

Theorem 1.3. Let {Qn, Pn,Tin) be a sequence of finite irreducible reversible Markov chains. 

(i) Assume that the sequence exhibits a continuous-time cutoff with a window Wn. Then 
it exhibits also an averaged cutoff with a window Wn. 


(n) 


Assume that the sequence exhibits an averaged cutoff with a window Wn- Then it exhibits 


also a continuous-time cutoff with a window tc/ := V 



Theorem 1.3 follows easily from Propositions 1.5 and 1.8 in conjunction with the following 
result. We prove Theorem 1.3 in § 5 for the sake of completeness. 


Proposition 1.12 ([4] Chen and Saloff-Coste). Let {fin, Pn, Tin) be a sequence of finite ir¬ 
reducible reversible Markov chains. The sequence exhibits a continuous-time cutoff with a 

window Wn iff it exhibits a lazy cutoff with a window Wn, in which case Wn = fl 

Remark 1.13. There are cases in which the cutoff window for the sequence of the associated 
averaged chains can be much smaller than that of the associated continuous-time chains. For 
instance, let Gn be a sequence of random n-vertex dn-regular graphs, for some dn such that 
logn <C dnlogdn = Let be the sequence of discrete-time simple random 

walks on Gn. Then [8] w.h.p. (i.e. with probability 1 — o(l), over the choice of the graphs) 

l4ve(£) - riogd„-i(c^n?^)l I < 1, for cvcry e e (0,1), 

while the cutoff window for the sequence of associated continuous-time chains is n. 
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1.1 Related work 

This work was greatly motivated by the results of Peres and Sousi in [9] about the averaged 
chain. Their approach relied on the theory of random times to stationarity combined with a 
certain “de-randomization” argument which shows that for every hnite irreducible reversible 
Markov chain and every stopping time T such that Xt ~ tt, tave < 220 maXa;gQ Ea;[T]. As 
a consequence, they showed that for all a G (0,1/2) (this was extended to a = 1/2 in [5]), 
there exist constants Ca,d^> 0 such that for every lazy hnite irreducible reversible chain 

c'jiiia) ^ tave ^ cJiiia), where 

ti{{a) := max E 3 ;[Ta] and Ta := infjt : Xt G A}. 

7r(A)>Q; 

Using this, they showed that there exist some absolute constants Ci, C 2 > 0 such that 

Cl^L ^ ^ave ^ ('2th- 

Implicitly, they showed that for every 0 < e < 1/4 and 0 < a < 1/2, 

4ve(£) ^ 

This was the hrst progress towards resolving Aldous and Fill’s Open Problem. Alas, this is 
too coarse for the purpose of resolving it. 

Our approach, which is somewhat similar to that taken in [3], is more direct than that 
taken in [9]. As in [3], where Starr’s maximal inequality was used to obtain a characterization 
of the cutoff phenomenon for reversible Markov chains, the key ingredient in the proof of 
Proposition 1.5 is a maximal ineqnality, due to Stein [10] (2.1). 

2 A maximal inequality 

In this section we state maximal inequalities which shall be utilized in the proof of the main 
results. We start with a few basic dehnitions. 

Definition 2.1. Let (hi, P, tt) he a finite reversible chain. For f G let 
^Af] ■= Var^/ := E^[{f -E^ff]. 

The inner-product (•, •)^ and norm are 

{f,g)n := ^nifg] and \\f\\p := {E^[\ff]A^^, 1 < p < oo. 

We identify P*, P^, At, Pit with the linear operators on L^(M^,7r) given by 

A,Hx) := 5^ AU.»)/(!/) =E,|/(Jfr)l, -ff./W ■=Y,Ht(x,y)f(y) = E,[/(X,'’)], 

j/so 1/eo 

P‘/W := E.[/(X,)] and Plf(x) := P‘(i,;,)/(;,) = E.[/(.q)]. 

y&n 

By reversibility P^, Pi, At,Ht \ L'^ ^ are all self-adjoint (w.r.t. (•, 
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Definition 2.2. Let P be a linear operator and k E 'L+. We define AP^ := P^+i — = 

P^{P — I). For r > 1, we define inductively A^P^ := A(A^“^P^) = A'’“^P^+^ — A^'^^P^ = 
pfc(p _ jy^ Similarly, we define AA^ := A^+i — A^ = |P^(P^ — I). 

Let (f2,/i) be a probability space. Let P ; —)■ be a positive (i.e. / > 

0 Pf > 0) self-adjoint linear operator whose spectrum is contained in the interval [0,1]. 

It is noted in [10] that for all r > 1, there exists a constant Cr (independent of and 

P), such that for every / G 

||sup(f + l)’'A^PV|| 2 <a||/|| 2 . (2.1) 

t>0 


In [6] Stein’s argument is extended to the setup where P is a positive contraction with 
M(P) := supAll^'+^ - P% < oo without the assumptions that P is self-adjoint and that 
its spectrum is contained in [0,1]. In this more general setup Cr depends also on M{P). 


Corollary 2.3. There exists an absolute constant C such that for every finite irreducible 
reversible Markov chain, (f2, P, vr) and every f G 


sup(t + l) AP‘/ 

t>o 


2 

< CVairrf 

2 


and 


2 


sup(f + 1) A Atf 
t>o 


< CVar^f. 


( 2 . 2 ) 


Proof: Note that AA^J = = 1 A (P'^f and AA^+J = i A (P")‘(P/). 

Hence (2.2) follows from (2.1) applied to Pl and P^ by noting that APff = APl(/ —E^[/]), 
AAtf = AAtif - E^[/]) and Var^(P/) < Var^/. □ 


3 Proof of Proposition 1.5. 

In this section we prove Proposition 1.5. As noted in the introduction, Theorem 1.1 follows 
as a particular case of Proposition 1.5 and Theorem 1.2, in turn, follows in a trivial manner 
from Theorem 1.1. We now state large deviation estimates for the Poisson and Binomial 
distributions. For a proof see e.g. [2, Appendix A]. 

Fact 3.1. Let Y ~ Pois(/i) and let Y' ~ Bin(f, 1/2). Then for every e > Q we have that 

P[E < p(l - e)] < P[r > /i(l + e)] < exp ’ (3 1) 

P[F' < f(l - e)/2] = P[F' > t{l + e)/2] < 

Let {N(t))t>o and {M(t))t>o he homogeneous Poisson processes with rate 1, such that 
{N(t))t>o , (M(t))f>o and (Xi)^Q are mutually independent. We define 

£ 

Ni^it) := N{t) + M{t) and S{i) := ^ ~ Bm{i, 1/2), 

k=l 


where qk := ^Nin)>N(Tk-B and := inf{f : Ni^{t) = k}. 












Let tt) be a Markov chain. The natural coupling of {Xt)t^z+ and 

is defined by setting X}" := Xs{t) and X^ := Xjv(t) = 

As can be seen from the natural coupling, Ht = X]fc>o ~—This also follows from 
Poisson thinning. Also, in the natural coupling (A'|")igz+ and (AL(^))t>o are independent. 
The same holds for {Xt)te.'z+ and (S'(f))^o- The next lemma follows from the natural coupling 
by a standard construction (cf. the proofs of Proposition 4.7 and Theorem 5.2 in [7]). 

Lemma 3.2. Let {Q,P,7r) be a finite irreducible Markov chain. Let g G and t G M_|_. 

(1) There exists a coupling such that {Yfi)i(,i+ ~ Pl.^, ~ 

Pl.tt (the law of the stationary lazy chain), ft ~ Pois(t) in which ft o>nd {Yfi)i^z+ 
independent and 

P[FL = = P[yL^, = zl’- for allt>0] = l- dfit/2, p). 

(2) There exists a coupling {{Yi)i^^^,{Zf)i^^^,f[), such that (17)iez+ ~ P/., (^f)*ez+ ~ 
P^ (the law of the stationary chain), ft ~ Bin(2f, 1/2) in which f[ and (l/)i6z+ are 
independent and 


Pir^. = Z^] = P[y 5 j+, = Zf for alli> 0] = 1- dU2t, p). 


Definition 3.3. Let t>l and s G [2,e*]. Denote 

r = rs,t ■= 2^/2t logs, 

J = Js,t ■= [{t - r)V 0,t + r], (3.2) 

m = ms,t ■= \r{y/s + 1 )]. 

In the notation of Lemma 3.2 (with both couplings taken w.r.t. time t), let G be the event 
that = Z^"'^ for all i > 0 and that ft G J. Similarly, let G' be the event that P/'+i = 
Zf for all i > 0 and that f[ G J . 

In the following proposition, we only care about (3.5) and (3.8) (which imply (1.6) and 
(1.7), respectively; i.e. the below proposition implies Proposition 1.5). We present the rest 
of the equations in order to make it clear that (3.8) is obtained in an analogous manner to 
(3.5). Thus, we shall only prove part (i) of Proposition 3.4. 

In the notation of Definition 3.3, the term dcit/2,g) + 2/s^ appearing in (3.3) and (3.5) 
(resp. di^{2t,g) + 2/s^ appearing in (3.6) and (3.8)) is an upper bound on the probability 
that G (resp. G') fails (where the term 2/s^ is obtained via Fact 3.1). 

Proposition 3.4. Let {Q,P,7r) be a finite irreducible reversible chain. Let g G Let 

B <zkl. Let f > 1 and 2 < s < e*. In the notation of Definition 3.3, 

(i) Let r]^ := ly^L and := (where m = [^(y^ +1)1; ^ = 2yJ2t\og s). Then 

n[B) - 6 -B] < ^ + rfc((/2. + EKt,"-' - t,‘')lo]. (3.3) 
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\E[{rj^ - < s-^E, 

Consequently, 


sup A PIIb\ 

i>r^/s 


< Cs ^Var^lp < 


C 


di^it + m, /i) < dcit/2, p) + — + ^JC/s. 


(3.4) 


(3.5) 


li) Let w ~ Bernoulli(1/2) be independent of Let t] = Ivt+m+n^&B 

andr]^ = lz-eB- Then 


71(3) - e 5] < - + 4(2t, /i) + E[{p- - rj)lG>]. 


|E[(r/-r/")lG']|2<s-X 

Consequently, 


sup A A1 b| 

i>ry/s 


c 

< As^^Var^ls < —. 

s 


ds.ve{t + /i) < dL{2t, p) + — + \/cJs. 


(3.6) 

(3.7) 

(3.8) 


Proof. We first note that (3.5) follows from (3.3)-(3.4) by maximizing over B G Ll. We 
now prove (3.3). Let B C il. Let r, J and m be as in Definition 3.3. By Fact 3.1 and our 
assumption that s < e* (which implies that e \= rft = 2\j2t~^ log s < 3), 


I 9 /r» 1.0 / .* ... 4 log S „ 

Pfe ^J]< P[6 <t-r] + P[^i >t + r]< e-*" < 28"^ 

Hence 1 — P[G] < dc{tf2, p.) + 2s“^, which implies (3.3), as 

niB) - G 5] < 1 - P[G] + P[G n G B}] - P[G n G B}] 

= l-F[G]+E[{rj^’--r,^)lG]. 

We now argue that for every a; G D, 

\E[r] - r]^’^ \G,Yl^=x = sup i\ A P/lij(a;)|. 

V ^ i>ry/s 

Indeed, for every x G and j ^ J 

E|y 16 = j. y," = X = z„"-] = pp’-'-hsO). 

Eh"" 16 = J, yf = I = zj;-”] = priB(x). 

Thus by the triangle inequality 


(3.9) 


lEh*- - rr-” I (, = i.y'- = I = z„‘'Ti = |py’”-'iB(x) - pru) 


yt+m-j 


X) 


[{t+m—j)Vm] — l 

<1,^, £ |Ap;iB(ni. 

i={t-\-m—j)Am 


(3.10) 
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Note that by the definition of m = |'T’(y^ + 1)] and J = [(t — r) V 0,t + r], for every j E J 
we have that \j — t\ < r and {t + m — j) A m > r^/s. Whence, 


[{t+m—j)\/m] — l 
i=(t-\-m—j)Am 


A P11b{x)\ < r sup I A P11b{x)\ 

i>ry/s 


< —^ sup i\ A P11b{x)\ = Vs ^ sup i\ A P11b{x)\. 

^V^ i>r^/s i>ry/s 


Plugging this estimate in (3.10) and averaging over j yields (3.9). 

SincG 

iE[(y - y-')iGii < E[|E[(y - y-'jio i zt'.ail. 

averaging (3.9) over , and using the fact that P[G fl = x = ’^}] < 7r(x), for all x, 

together with Jensen’s inequality and (2.2), we get that 

< -(E^[sup i\APllB\]V < -E^[sup V\AP11b\‘^] < Cs~W&tAb < Cjs. 

^ i>ry/s ^ i'>r^/s 


□ 


4 Proof of Proposition 1.8 


We start the section by stating a standard fact. 

Claim 4.1. Let (12, P, vr) he a finite irreducible chain. Let p G ,^^(12). Let (W)tez+ he the 
discrete-time version of the chain. Let Ti,T 2 be independent Z_|_ valued random variables 
independent of (W)4ez+- Then \\PVXti+T 2 ^ ~ ttHtv < ^ •] “ ttHtv? where 

P„ 1 AV. = y] := E, PK = t]PllX, = y] and P„ 1 At,+t. = v] ~ E, PlTi +r 2 = ilP^Vi = vl 

Proof of Proposition 1.8: Fix some t > 0 and 0 < s < \/t. Denote r := t + sy/i. We first 
prove (1.10). In the notation of the standard coupling, N^x) ~ Poisson(2r) and 

k>0 


By the triangle inequality, together with (3.1) and the fact that ||Pl,/x“^IItv is non-decreasing 
in k and bounded by 1, 


|H; - JtIItv = 5^ PIAlW = A^|||Pt„ - JtIItv < PI^lW < 2«] + PINl{t) = A^1||P£,„ - ttHtv 

k>0 k>2t 

AsH 


< exp 


2{2t + 2sVi). 


+ l|Pp/. - ttIItv < 4(2t, /i) + e 


where in the last inequality we have used the assumption that s < y/i. This concludes the 
proof of (1.10). We now prove the first line in (1.9). We omit the second line in (1.9) as its 
proof is analogous and as it essentially appears in [9, Lemma 2.3]. 
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As above, denote r := t + sy/i. Let Y ~ Poisson(2r). Let Zi be a random variable whose 
conditional distribution, given that Y = n, is Bin((n — 1) V 0,1/2). Let ?7 be a Bernoulli 
random variable with mean 1/2, independent of Zi and Y. Set Z := Zi + r^ly^o- Let 
{Xt)t£Z+ be the discrete-time version of the chain with Xq ~ /i. Pick Y, Zi, rj and {Xt)tez+ 
to be jointly independent. Note that the conditional distribution of Z, given that Y = n, is 
Bin(n, 1/2). Hence by Poisson thinning Z ~ Poisson(r) and so Xj: ~ Xz- 

Let T := t + r]. Then Z = {T + Zi — t)ly>o- Thus Zlzi>t = {T + {Zi — t)^)lzi>t, where 
a+ := a V 0 (since Zi >t implies that F > 0 and Zi —t = (Zi —1)+). Consequently, 


||Pm(^z e •) - v,{XT+iz,-t)+ e Olkv < 11^ - [T + (^i - t)+]||Tv < P[^i < t]. 
By (3.1) and the assumption s < \/t, 

r _ 2 j. 1 


P[Zi <t]< P[Z <t]< exp 


s'^t 


< e 


2{t + Sy/t) 

Finally, by Claim 4.1, in conjunction with (4.1)-(4.2), we get that 


dc{t + s\/t,p) = ||P^[X^ e •] - ttIItv < 

\\Ppi{Xz e ■) - V^{XT+{Zr-t)+ e OIItv + ||PM(^r+(Zi-t)+ £ •) “ ^IItv 
< + ||P^(Xt e •) - vtIItv = 4ve(L /i) + □ 


(4.1) 


(4.2) 


5 Proof of Theorem 1.3 


Assume that there is a continuous-time cutoff with a window Wn- Fix some 0 < e < 1/4. By 
Propositions 1.5 (first inequality) and 1.12 (second inequality) 

<S(c) < t\^n) + < «I”>(e/2) + C^(e)w^. 

By Propositions 1.8 (first inequality) and 1.12 (second inequality) we have that 

- ^/2) + C'3(e) - ^/2) + C^{e)wn. 

Hence 


4ve(^) - 4ve(l - £) < 4’'H£/2) - - e/2) + C5{e)Wn < Ce{e)wn, 

as desired. Now assume that the sequence of averaged chains exhibits a cutoff with a window 
Wn- By Proposition 1.8 


tP(e)<t<~>{e/'2) + Cj{e) 
By Propositions 1.5 we have that 




■:) < -(£^(1 


e/2) -|- Cs{e) 



Hence 

(£>(<!)-y’(i-<5) <*£i(<5/2)-f£i(i 

as desired. 


e/2) + Cg{e) 



< Cio{e){Wn V 



□ 
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6 Example 


In this section we consider an example which demonstrates that the assertions of Theorems 
1.1 and 1.2 and of Proposition 1.5 are in some sense nearly sharp. For notational convenience 
we suppress the dependence on n in some of the notation below. Throughout this section we 
write Co,Ci,C 2 ,... for positive absolute constants, which are sufficiently small to guarantee 
that a certain inequality holds. 

Equation (6.1) below resembles our main results apart from the fact that below the 
direction of the inequality is reversed, and the exponent of s in the error term of the middle 
term in (6.1) (which decays like an inverse polynomial in s) is larger (compared to the 
corresponding exponent in Theorem 1.1; similarly, the error term on the RHS of (6.1) is 
similar to the one appearing in Theorem 1.2, that is to tpa ,C2{dc{t)) - dc{t)). 

Example 6.1. Fix some 0 < a < 1/2. Let n ^ N be such that s = Sn,a ■= > 2. 

Consider a nearest-neighbor random walk on the interval {0,1, 2,..., 2n + 1}, with a bias 
towards state 2n-\-1, whose transition matrix is given by P(0,1) = 1, P{2n+ 1, 2n) = 1 — 

^, , f ^ i > 2n — 2s, 

0 otherwise. 


Finally, P{i, i + 1) = 3P{i, i — 1) for all 1 < i < 2n and is given by 


P{i,i + 1) 


I - ^ i>2n-2s, 

3/4 otherwise. 


By Kolmogorov’s cycle condition, this chain is reversible. Both the seguence of the associated 
continuous-time chains and the seguence of the associated averaged chains exhibit cutoff 
around time An with a cutoff window of size ^/n. In particular, prior to time An — s the 
worst-case total variation distance from stationarity of both chains tends to 1 as n tends to 
infinity. Moreover, it is not hard to show that 

dfiAn + s) = (1 ± o( 1 ))Ho[T 2 „+i > 4n + s] < 


Conversely, we now show that for t = An -\- s, we have that 

ds.ve{t + s) > dc{t) H-> dc{t) H 1 + 2 ^ - (6-1) 

^ [log(l/dc(t))] 

The second ineguality in (6.1) follows from the choice s = together with dfit) = 

dc{An + s) < We now prove the first ineguality in (6.1). 

Consider the sets Even := {2i : 0 < i < n}. Odd := {2i + 1 : 0 < i < n} and 
B := {i : i > 2n — 2s}. It is easy to see that 7i{B) > 1 — and that 

0 < 7r(Even) - 1/2 < ~ < 2~^k (6.2) 

35 

In order to prove (6.1), we shall show that 

A+^(0, Even) > ^ + —. (6.3) 

2 s 
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Let discrete-time chain with Xq = 0. Note that T 2 n- 2 s is even, deterministi¬ 

cally. If both X 4 n+ 2 s and X 4 n+ 2 s+i lie in B, we define 


T := min{/c : T 2 n- 2 s < k < An + 2s and X^ E B for all k < £ < An -\- 2s -\- 1}. 
Otherwise, set T = 0. It is easy to see that P[T = 0] < and that 

-j^o[^in+ 2 s ^ Even | T = 0] + -Po[-^ 4 n+ 2 s+i G Even | T = 0] = 1/2. (6-4) 

Moreover, conditioned on T > 0, the number of returns to state 2n — 2s by time An + 2s has 
an exponential tail. Using this fact, it is not hard to verify that 

min P[T is even IT ^ 0,4n + 2s — T 2 n- 2 s = 2rl > 1 — —. 

0<r<4s ' s ^55) 

P[4n + 2s - T 2 n- 2 s > 8s I T ^ 0] < 


Consider the projected chain ^ (conditioned on T 7 ^ 0/ on := {±1} defined 

iT+fceOdd- This two state chain whose transition matrix is given by 


via Yk iT+fceEven 
A 1 _ A 
2 -^2 

A A 

2 2 


P = 


1 - ^ 


where A := satisfies P 


= (A - 1) 


Using the spectral 


decomposition it is easy to verify that Ak{l, 1) = A + Note that if k < 8s then for 

even k’s we have that 0 < Akil, 1) — A = 0(s“^) and for odd k’s t) < \ — Akil, 1) = 0(s“^). 

Applying this for k = r when T = 4n + 2s — r > 0, in conjunction with ( 6 . 4 )-(6.5) yields 
(6.2) by averaging over An + 2s — T and bounding separately the contribution of all even 
times (i.e. An + 2s — T = 2k, k < As) and of all odd times, which are bounded from above by 
8s . We leave the details as an exercise. 
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